Data sampling for improved speech recognizer training

نویسندگان

Takahiro Shinozaki

Mari Ostendorf

Les E. Atlas

چکیده

Proper data selection for training a speech recognizer can be important for reducing costs of developing systems on new tasks and exploratory experiments, but it is also useful for efficient leveraging of the increasingly large speech resources available for training large vocabulary systems. In this work, we investigate various sampling methods, comparing the likelihood criterion to new acoustic measures motivated by work in child language acquisition. The acoustic criteria can be used with or without pre-existing transcriptions or models. When applied to the problem of selecting a small training set, the best results are obtained using modulation spectrum features and a discriminant function trained on child vs. adult-directed speech. For large corpora, none of the methods outperforms random sampling, but reduced training costs are obtained by using multistage training and initializing with the small corpus.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

UNSUPERVISED TRAINING OF A SPEECH RECOGNIZER : RECENTEXPERIMENTSThomas

Current speech recognition systems require large amounts of transcribed data for parameter estimation. The transcription , however, is tedious and expensive. In this work we describe our experiments which are aimed at training a speech recognizer with only a minimal amount (30 minutes) of transcriptions and a large portion (50 hours) of un-transcribed data. A recognizer is bootstrapped on the t...

متن کامل

ALGONQUIN: iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition

One approach to robust speech recognition is to use a simple speech model to remove the distortion, before applying the speech recognizer. Previous attempts at this approach have relied on unimodal or point estimates of the noise for each utterance. In challenging acoustic environments, e.g., an airport, the spectrum of the noise changes rapidly during an utterance, making a point estimate a po...

متن کامل

Unsupervised training of a speech recognizer: recent experiments

Current speech recognition systems require large amounts of transcribed data for parameter estimation. The transcription, however, is tedious and expensive. In this work we describe our experiments which are aimed at training a speech recognizer with only a minimal amount (30 minutes) of transcriptions and a large portion (50 hours) of untranscribed data. A recognizer is bootstrapped on the tra...

متن کامل

Improving Children's Speech Recognition by HMM Interpolation with an Adults' Speech Recognizer

In this paper we address the problem of building a good speech recognizer if there is only a small amount of training data available. The acoustic models can be improved by interpolation with the well-trained models of a second recognizer from a different application scenario. In our case, we interpolate a children’s speech recognizer with a recognizer for adults’ speech. Each hidden Markov mod...

متن کامل

Unsupervised training of a speech recognizer using TV broadcasts

Current speech recognition systems require large amounts of transcribed data for parameter estimation. The transcription, however, is tedious and expensive. In this work we describe our experiments which are aimed at training a speech recognizer without transcriptions. The experiments were carried out with TV newscasts, that were recorded using a satellite receiver and a simple MPEG coding hard...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Data sampling for improved speech recognizer training

نویسندگان

چکیده

منابع مشابه

UNSUPERVISED TRAINING OF A SPEECH RECOGNIZER : RECENTEXPERIMENTSThomas

ALGONQUIN: iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition

Unsupervised training of a speech recognizer: recent experiments

Improving Children's Speech Recognition by HMM Interpolation with an Adults' Speech Recognizer

Unsupervised training of a speech recognizer using TV broadcasts

عنوان ژورنال:

اشتراک گذاری